Estimation of absolute depth from polarization measurements

ABSTRACT

A head mounted display comprises an eye tracking system configured to enable eye tracking using polarization. The eye tracking system includes one or more illumination sources and an optical detector comprising polarization sensitive pixels. The one or more illumination sources are configured to illuminate a user&#39;s eye and generate reflections directed towards the optical detector. The eye tracking system determines, for each polarization sensitive pixel in a subset of the polarization sensitive pixels, one or more estimation parameters. The eye tracking system determines, for the subset of the polarization sensitive pixels, depth information for one or more glints associated with one or more surfaces of the eye, based in part on the polarization of the reflections and the one or more estimation parameters. The determined depth information is used to update a model of the eye. The eye tracking system determines eye tracking information based on the updated model.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of co-pending U.S. application Ser. No. 16/381,549, filed Apr. 11, 2019, which is a continuation of U.S. application Ser. No. 15/808,758, filed Nov. 9, 2017, now U.S. Pat. No. 10,311,584, each of which is incorporated by reference in its entirety.

BACKGROUND

The present disclosure generally relates to depth sensing, and specifically relates to estimation of absolute depth from polarization measurements.

Eye tracking is an important feature for head-mounted display (HMD) systems including systems used in artificial reality applications. An essential part of many features required to achieve compelling artificial reality relies on an accurate way to track optical axes of users' eyes. For example, a preferred solution to the vergence-accommodation problem is expected to rely critically on eye tracking so that a display can move to accommodate the expected depth of field that corresponds to the natural human vergence at any given time. Another example is foveated rendering, the technique that displays a variable resolution image on a display to match the variable spatial resolution capability of the human eye as the eye rotates to different positions around the display area. However, the problem is that human eyes rotate over a wide range of angles (e.g., up to 55 degrees) in all directions and can rotate very quickly and unpredictably, which limits capabilities of known techniques.

Conventional tracking systems that track features of the human eye often utilize a patterned illumination source (e.g., dots, lines) in conjunction with a standard imaging camera to determine a location of glints reflected off the eyes (i.e., the virtual image of the light source as observed behind the eye's surface). With enough glints detected, and a spherical assumption for the shape of the eyeball and cornea, an approximate eyeball geometry can be inferred. In this case, the optical axis is assumed to be perpendicular to the front center surface of the identified cornea. However, human eyes differ widely between people, often violating the spherical assumption and exhibit complicated sub-surface scattering events that can obscure sensitive measurement techniques that estimate the outer surface of the eye directly (e.g., using structured light, time-of-flight depth cameras, etc.).

Thus, the conventional tracking systems are typically limited by the quality of the optical path and do not provide sufficient accuracy needed for eye tracking in a HMD system.

SUMMARY

A head mounted display (HMD) comprises an eye tracking system that tracks eye(s) of a user of the HMD based in part on polarization of light reflected from the user's eye(s). The eye tracking system includes one or more illumination sources configured to illuminate a user's eye using light, an optical detector to capture polarized light reflecting from at least one surface of the user's eye, and a controller. In an embodiment, the one or more illumination sources can illuminate the user's eye with a certain periodicity. The optical detector and the one or more illumination sources are positioned relative to each other such that the optical detector is able to capture light emitted by the one or more illumination sources and reflected from one or more surfaces of the user's eye (hereinafter referred to as “eye reflections” or “glints”). The optical detector comprises an array of a plurality of polarization sensitive pixels and is able to determine the polarization state (e.g., polarization angle) of the eye reflections.

In some embodiments, the optical detector captures one or more images of the user's eye. The optical detector further determines intensity and polarization information associated with the one or more images. The controller determines, for each polarization sensitive pixel in a subset of the plurality of polarization sensitive pixels, one or more estimation parameters associated with the eye tracking system. The controller determines, for the subset of the plurality of polarization sensitive pixels, depth information for one or more glints associated with one or more surfaces of a plurality of surfaces of the eye, based in part on the determined polarization information and the one or more estimation parameters. A glint can refer to a reflection of one of the illumination sources (e.g., a point source) from a surface of an eye, i.e., to a virtual image of the point source as observed behind the eye's surface. The controller is configured to update a model of the eye based on the determined depth information for the one or more glints. The controller is further configured to determine various types of eye tracking information such as a user's gaze direction, vergence angle/depth, and accommodation depth, based on the updated model of the eye.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1A is a diagram of a head-mounted display (HMD), in accordance with an embodiment.

FIG. 1B is a cross section of a front rigid body of the HMD in FIG. 1A, in accordance with an embodiment.

FIG. 2 depicts a block diagram of an eye tracking system, in accordance with an embodiment.

FIG. 3 depicts an example operation of the eye tracking system of FIG. 2, in accordance with an embodiment.

FIG. 4 depicts an example system diagram for determining a depth of an eye's surface, in accordance with an embodiment.

FIG. 5 is a flow chart illustrating an eye tracking process, in accordance with an embodiment.

FIG. 6 is a block diagram of a HMD system in which a console operates, in accordance with an embodiment.

The figures depict embodiments of the present disclosure for purposes of illustration only. One skilled in the art will readily recognize from the following description that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles, or benefits touted, of the disclosure described herein.

DETAILED DESCRIPTION

Embodiments of the present disclosure may include or be implemented in conjunction with an artificial reality system. Artificial reality is a form of reality that has been adjusted in some manner before presentation to a user, which may include, e.g., a virtual reality (VR), an augmented reality (AR), a mixed reality (MR), a hybrid reality, or some combination and/or derivatives thereof. Artificial reality content may include completely generated content or generated content combined with captured (e.g., real-world) content. The artificial reality content may include video, audio, haptic feedback, or some combination thereof, and any of which may be presented in a single channel or in multiple channels (such as stereo video that produces a three-dimensional effect to the viewer). Additionally, in some embodiments, artificial reality may also be associated with applications, products, accessories, services, or some combination thereof, that are used to, e.g., create content in an artificial reality and/or are otherwise used in (e.g., perform activities in) an artificial reality. The artificial reality system that provides the artificial reality content may be implemented on various platforms, including a head-mounted display (HMD) connected to a host computer system, a standalone HMD, a mobile device or computing system, or any other hardware platform capable of providing artificial reality content to one or more viewers.

An eye tracking system presented herein which may be integrated into a HMD includes an illumination system and a polarization sensitive camera. The illumination system (e.g., array of sources) projects unpolarized light onto at least a portion of an eye. The term “unpolarized light” refers hereinafter to randomly polarized light. For randomly polarized light, each individual photon has a polarization state independent of each other providing that the population of photons in the light is randomly polarized. Polarization of light reflected by at least one surface of the eye varies in part on an angle of incidence. The polarization sensitive camera detects the reflected light and its associated polarization state. The eye tracking system generates and further updates a model of the eye. The model of the eye includes depth information for one or more glints associated with one or more surfaces of the eye using images captured by the polarization sensitive camera and known locations from the polarization sensitive camera to corresponding sources in the illumination system. A glint can be hereinafter referred to a reflection of one of illumination sources (e.g., a point source) from a surface of an eye, i.e., to a virtual image of the point source as observed behind the eye's surface. The eye tracking system determines eye tracking information including an orientation of the eye, based on the updated model of the eye.

In some embodiments, the eye tracking system is integrated into a HMD for determining eye tracking information related to one or both eyes of a user wearing the HMD. The HMD displays content to a user. The HMD may be part of an artificial reality system. The HMD further includes an electronic display and an optical assembly. The electronic display is configured to emit image light. The optical assembly is configured to direct the image light to an exit pupil of the HMD corresponding to a location of the user's eye determined by the eye tracking system. The HMD can further adjust resolution and/or focus of images displayed on the electronic display, based at least in part on the determined eye tracking information.

FIG. 1A is a diagram of a HMD 100, in accordance with an embodiment. The HMD 100 may be part of an artificial reality system. In embodiments that describe AR system and/or a MR system, portions of a front side 102 of the HMD 100 are at least partially transparent in the visible band (˜380 nm to 750 nm), and portions of the HMD 100 that are between the front side 102 of the HMD 100 and an eye of the user are at least partially transparent (e.g., a partially transparent electronic display). The HMD 100 includes a front rigid body 105 and a band 110.

The front rigid body 105 includes one or more electronic display elements (not shown in FIG. 1A), one or more integrated eye tracking systems (not shown in FIG. 1A), an Inertial Measurement Unit (IMU) 130, one or more position sensors 135, one or more locators 120, and a reference point 115. The locators 120 are located in fixed positions on the front rigid body 105 relative to one another and relative to a reference point 115. In the example of FIG. 1A, the reference point 115 is located at the center of the IMU 130. In various embodiments, each of the locators 120 emit light that is detectable by an external imaging device.

In the embodiment shown by FIG. 1A, the position sensors 135 are located within the IMU 130, and neither the IMU 130 nor the position sensors 135 are visible to a user of the HMD 100. The IMU 130 is an electronic device that generates fast calibration data based on measurement signals received from one or more of the position sensors 135. A position sensor 135 generates one or more measurement signals in response to motion of the HMD 100. Examples of position sensors 135 include: one or more accelerometers, one or more gyroscopes, one or more magnetometers, another suitable type of sensor that detects motion, a type of sensor used for error correction of the IMU 130, or some combination thereof. The position sensors 135 may be located external to the IMU 130, internal to the IMU 130, or some combination thereof.

FIG. 1B is a cross section 125 of the front rigid body 105 of the embodiment of a HMD 100 shown in FIG. 1A. As shown in FIG. 1B, the front rigid body 105 provides altered image light to an eye-box 150. The front rigid body 105 includes an eye-box 150, a display unit 160, an optics block 165, an eye tracking system 175, and an illumination source 176. While, depicted separate from the eye tracking system 175 in FIG. 1B, in some embodiments, the eye tracking system 175 includes, optionally, the illumination source 176. The eye-box 150 is a region in space that an eye of a user occupies while wearing the HMD 100, i.e., a region in space where an eye 180 is positioned. For purposes of illustration, FIG. 1B shows a cross section 125 associated with a single eye 180, but in other embodiments, the front rigid body 105 provides altered image light to another eye of the user. The eye tracking system 175 and the illumination source 176 facilitate tracking the eye 180 based on polarization information associated with the light reflected from the eye 180. The optics block 165 provides altered image light generated by the display unit 160 to the eye 180. In various embodiments, the illumination source 176 is part of the display unit 160 and the eye 180 is illuminated through the optics block 165. These elements are further described below.

The display unit 160 emits image light toward the optics block 165. In various embodiments, the display unit 160 may comprise a single electronic display or multiple electronic displays. Examples of an electronic display include: a liquid crystal display (LCD), an organic light emitting diode (OLED) display, an active-matrix organic light-emitting diode display (AMOLED), a quantum organic light emitting diode (QOLED) display, a quantum light emitting diode (QLED) display, a transparent organic light emitting diode (TOLED) display, some other display, or some combination thereof. In alternate embodiments, one or more pixels of the display unit 160 act as the illumination source 176.

The optics block 165 magnifies received light from the display unit 160, corrects optical errors associated with the image light, and the corrected image light is presented to a user of the HMD 100. The optics block 165 is an optical element, such as an aperture, a Fresnel lens, a convex lens, a concave lens, a filter, or any other suitable optical element that directs the light emitted from the display unit 160 towards the user. In other embodiments, the optics block 165 may include combinations of different optical elements. In some embodiments, one or more of the optical elements in the optics block 165 may have one or more coatings, such as partial reflectors or anti-reflective coatings.

Magnification of the image light by the optics block 165 allows the display unit 160 to be physically smaller, weigh less, and consume less power than larger displays. Additionally, magnification may increase a field of view of the displayed content. For example, the field of view of the displayed content is such that the displayed content is presented using almost all (e.g., 110° diagonal), and in some cases all, of the user's instantaneous field of view. In some embodiments, the optics block 165 is designed so its effective focal length is larger than the spacing to the display unit 160, which magnifies the image light projected by the display unit 160. Additionally, in some embodiments, the amount of magnification may be adjusted by adding or removing optical elements.

The optics block 165 may be designed to correct one or more types of optical error. Examples of optical error include: distortion, chromatic aberration, spherical aberration, field curvature, astigmatism, or any other type of optical aberrations. In some embodiments, content provided to the display unit 160 for display is pre-distorted, and the optics block 165 corrects the distortion when it receives image light from the display unit 160 generated based on the content.

The eye tracking system 175 updates a model of the eye 180 and determines eye tracking information associated with the eye 180. The eye tracking system 175 may include one or more optical detectors and optionally includes the illumination source 176. The optical detectors capture eye reflections from the eye 180 and determine a polarization state of the eye reflections. Based on the determined polarization state of the eye reflections, the eye tracking system 175 determines a depth map of the corneal surface and updates a model of the eye 180. As the user looks at various objects within the HMD 100, the eye tracking system 175 is configured to determine the user's gaze direction, vergence angle/depth, and accommodation depth.

The illumination source 176 illuminates the eye 180 with light. In some embodiments, the illumination source 176 may be external to the eye tracking system 175. In alternate embodiments, the illumination source 176 may be integrated into the eye tracking system 175. The illumination source 176 comprises one or more optical sources configured to illuminate the eye 180 with light. The illumination source 176 may emit light in the visible band (˜380 nm to 750 nm), the IR band (˜750 nm to ˜1,800 nm), some other band of light, or some combination thereof. The illumination source 175 may be a laser (e.g., a tunable laser, a continuous wave laser, a pulse laser, other suitable laser emitting light), a light emitted diode (LED), a fiber light source, another other suitable light source emitting infrared and/or visible light, or some combination thereof. Additionally, the illumination source 176 may emit light that is unpolarized, linearly polarized, circularly polarized, or any combination thereof. In various embodiments, the one or more illumination sources 176 are located off-axis to not obstruct a user's view of the display unit 160. Each of the one or more illumination sources 176 (also referred to herein as illuminator) illuminates a portion of the eye 180 with light at a predetermined illumination power. The predetermined illumination power is less than a threshold that causes injury to the eye. In an embodiment, the one or more illumination sources 176 periodically illuminate the portion of the eye 180.

As noted above, in alternate embodiments, the illumination source 176 is part of the display unit 160. That is, the illumination source 176 may comprise one or more LEDs which are integrated into the electronic display associated with the display unit 160. In these embodiments, the display unit 160 is configured to periodically illuminate the eye 180. For example, if the refresh rate of the electronic display associated with the display unit 160 is 60 Hertz, the display unit 160 may be configured to illuminate the eye 180 for 30 milliseconds during 1 second of dead time between consecutive electronic display activations.

The light emitted by the illumination source 176 is incident on a portion of the eye 180 from a wide range of angles. In embodiments where the illumination source 176 is part of the display unit 160, this is easily achieved. However, in embodiments, where the illumination source 176 is separate and distinct from the display unit 160, the illumination source 176 is additionally coupled to a diffuser and the illumination source 176 is configured to produce multiple reflections within the HMD 100. Materials of the HMD 100 and illuminator 176 may be chosen to enable diffuse, or specular, reflections, thereby maintaining the polarization state (e.g., unpolarized light) featured before reflection. For example, in the embodiment described above, materials for the HMD 100 and/or the illuminator 176 are chosen in order to ensure that the reflected light remains unpolarized. It should be noted that in one or more embodiments, the light emitted by the illumination sources 176 is incident on the eye from every possible angle [0, 2π].

A portion of the eye 180 to be illuminated by the illumination source 176 may be selected for detection of a change in polarization upon reflection. That is, portions of the eye 180 that result in a change in polarization upon reflection during eye movement are selected for illumination by the illumination source 176. For example, the illuminated portion of the eye 180 may have a maximum reflectivity. The illuminated portion may be located on, e.g., an edge of the sclera, surface of the cornea, a limbus (e.g., junction of the cornea and sclera, a junction of the iris and the sclera, a junction of the iris and pupil, or any other suitable junction in the eye).

The light reflected from the eye 180 is captured by the eye tracking system 175. In various embodiments, at the boundary between the corneal surface and air, the light from the illumination source 176 is reflected towards the eye tracking system 175. The shape (e.g., radius of curvature) of the corneal surface causes small changes in the polarization state of the reflected light. In one or more embodiments, un-polarized light incident on a corneal surface is reflected off of the corneal surface towards the eye tracking system 175. The incident un-polarized light (e.g., emitted from the illumination source 176) that is reflected off of the corneal surface of the eye 180 is partially polarized. For example, s- and p-polarized light has different reflection coefficients and light reflected towards the eye tracking system 175 have a different intensity based on at least, a polarization angle and a polarization state. The intensity of light captured at each polarization angle (e.g., 0°, 45°, 90°, and 135°) is related to the size and shape of the corneal surface. In other embodiments, light captured by the eye tracking system 175 includes light polarized at an arbitrary polarization angle between 0° and 360°. In one or more embodiments, the eye tracking system 175 generates an image of the corneal surface from intensities captured at three or more polarization angles. For example, in the example above, the eye tracking system 175 determines an image of the corneal surface at polarization angles of 0°, 45°, 90°, and 135°. The eye tracking system 175 is further described below in conjunction with FIG. 2.

The eye tracking system 175 determines depth information for at least one glint associated with at least one eye surface (e.g., pupil, corneal surface, sclera, etc.) based on the one or more captured images of the at least one eye surface and one or more estimation parameters associated with the eye tracking system 175. The captured images of the at least one eye surface include intensity and polarization information. Some, or all, of the eye tracking system 175 may or may not be in a line of sight of a user wearing the HMD 100. The eye tracking system 175 is typically located off-axis to avoid obstructing the user's view of the display unit 160. In other embodiments, the eye tracking unit 175 may alternately be placed elsewhere. There is at least one eye tracking system 175 for the user's left eye and at least one tracking unit for the user's right eye. However, it should be noted that in still other embodiments, one eye tracking system 175 may track both the left and right eye of the user. In some embodiments, the eye tracking system 175 and/or the illumination source 176, both, utilize the optics block 165 to image and illuminate the eye 180. For example, the optics block 165 may include a hot mirror positioned to receive IR light from the illumination source and reflect the received eye reflections towards to the eye tracking system 175.

The eye tracking system 175 updates a model of the eye 180 based in part on the determined depth information. In some embodiments, the eye tracking system 175 updates the model of the eye by utilizing the depth information associated with each glint to fit a model component (e.g., model surface) in the eye model to a surface of the eye (e.g., pupil, cornea, sclera, etc.) to update the model of the eye. In various embodiments, the eye tracking system 175 determines eye tracking information based at least in part on the updated eye model. Eye tracking information describes a position and/or an orientation of the eye 180. The eye tracking system 175 is coupled to a polarization sensitive camera including one or more illumination sources to perform the necessary data processing for generating the eye model and determining eye tracking information. Eye tracking information determined by the eye tracking system 175 includes a gaze direction (also referred to a direction of a fovea axis that is an axis that bisects a center of a fovea and a center of an eye's pupil), a gaze position (also referred to as an fixation point where the user is looking), gaze time (how long the user is looking a particular direction), a vergence angle (an angle between two eyes when the user changes viewing distance and gaze direction), an inter-pupillary distance (IPD, defined as the distance between the center of the pupils of the two eyes) of the user. For example, the eye tracking system 175 can determine IPD by estimating eye position when the user is focused at infinity or, e.g., another object far from the user. In the example above, the eye tracking system 175 determines a gaze direction by extrapolating a direction based on the estimated position and determined IPD. The eye tracking system 175 determines vergence angle by estimating changes of a user is viewing distance and gaze direction. The eye tracking system 175 is able to determine the foveal axis, the orientation the orientation of the foveal axis from a pupillary axis, and a change in the shape of the eye 180.

Based on the determined and tracked position and orientation of the eye 180 (i.e., eye-gaze), the HMD 100 may adjust presentation of an image displayed on the display unit 160. In some embodiments, the HMD 100 may adjust resolution of the displayed image based on the eye tracking information. In an example embodiment, a maximum pixel density for displaying an image on the display unit 160 can be provided only in a foveal region of the determined eye-gaze, whereas a lower resolution display is employed in other regions, without negatively affecting the user's visual experience. The eye tracking system 175 is further described, below, in conjunction with FIG. 2.

FIG. 2 depicts a block diagram of an eye tracking system 200 in accordance with an embodiment. The eye tracking system 200 determines eye tracking information for an eye associated with a user (e.g., eye 180). In one or more embodiments, the eye tracking information is determined based on polarization information associated with light reflected and/or scattered from the eye. Eye tracking information includes information about a position and orientation of the user's eye in an eye-box (i.e., information about an angle of an eye-gaze). The eye tracking system 200 includes a data store 210, an illumination source 220, a camera 230, and a controller 240. In other embodiments, the eye tracking system 200 may contain additional or fewer elements than those discussed herein. In some embodiments, the eye tracking system 200 is an embodiment of the eye tracking system 175. In other embodiments, the eye tracking system 200 is a component of some other system that performs eye tracking.

The data store 210 stores one or more models of eyes (e.g., eye 180) of users. In an embodiment, a stored model of an eye in the data store 210 comprises two spheres. Together these spheres represent the eye associated with the user including the user's cornea. In one or more embodiments, the model of the eye is parameterized by characteristics of the two spheres. For example, the model of the eye is parameterized by the radius of each sphere, a location of the centers of the two spheres, and a distance between the centers of these spheres. The values of the model parameters associated with a given model are stored within the data store 210. The model and the model parameters are stored in the data store 210 as a <key, value> pair. In an embodiment, the key is the name of a model and value represents a model parameter associated with the model, respectively. A model parameter may describe, e.g., a radius of a corneal sphere, a center location of the corneal sphere, a radius of an eye sphere, a center of the eye sphere, a pupil diameter, a pupil location, optical axis, or some other parameter associated with a model of the eye. As is described in detail below with regard to FIG. 3, an eye may be modeled using two spheres. An eye sphere, a portion of which generally maps to the sclera of the eye, and a corneal sphere, a portion of which describes a corneal surface of the eye. In one or more embodiments, the model of an eye is segmented into one or more parts (i.e., iris, sclera, pupil, etc.). Here, each of the segmented portions of the eye model are also stored in the data store 210 as <key, value> pairs. In some embodiments, each of the model parameters associated with a model is stored as a key value pair such as <key, value1, value2>. For example, a model of the eye associated with a user named John with smaller sphere (e.g., the corneal sphere) radius of 8 mm and a larger sphere (e.g., the eye sphere) radius of 25 mm, is stored as <John, 25, 8>. In other embodiments, the data store 210 additionally stores other parameters associated with an eye model. For example, other model parameters stored in the data store 210 include sphere orientation, and sphere location. Eye models are further described below in conjunction with FIG. 3. In other embodiments, other storage schemes may additionally or alternatively be employed to store and retrieve a model of the eye in the data store 210.

The illumination source 220 illuminates the eye with unpolarized light from every possible incident angle (e.g., [0, 2π]). In various embodiments, the illumination source 220 is configured to generate corneal reflections directed towards the camera 230. The illumination source 220 includes one or more light sources that illuminate one or both eyes of the user. The illumination source 220 is an embodiment of the illumination source 176 that is described above in conjunction with FIG. 1.

The camera 230 captures one or more polarized images of the corneal surface. The captured images describe a polarization state of light reflected and/or scattered by the eye (e.g., eye reflections). In various embodiments, the camera 230 is one or more polarization sensitive cameras that are configured to detect the intensity of light at three or more different polarization angles. In still other embodiments, the camera 230 is further configured to capture linearly polarized light, circularly polarized light, or any combination thereof. In some embodiments, the camera 230 can detect the intensity of light linearly polarized at polarization angles 0°, 45°, 90°, and 135°. In other embodiments, camera 230 is configured to capture light linearly polarized at one or more other polarization angles. That is, in another example the camera 230 is configured to capture light linearly polarized at polarization angles of 0°, 15°, 30°, and 60°.

In some embodiments, the camera 230 comprises an array of polarization sensitive pixels. Here, the size of each polarization sensitive pixel is generally limited by the available CMOS production processes including the size of polarization filters. Each polarization sensitive pixel comprises a plurality of native pixels. In an example embodiment, each native pixel is approximately 1-10 micrometers (μm) in length. In the example embodiment described above, a polarization sensitive pixel comprising the native pixels is between 3-30 μm in length. In other embodiments, a polarization sensitive pixel comprises fewer or more native pixels than three native pixels. The size of the associated polarization sensitive pixel may scale with the number, and size of the native pixels comprising the polarization sensitive pixel. The one or more native pixels are arranged in a pattern (i.e., the RGB Bayer pattern) and measurements from each of the one or more native pixels are interpolated between filters in a rectangular pattern, a triangular pattern, or a square pattern. That is, a polarization sensitive pixel comprising three native pixels may be arranged such that interpolations happen triangularly, or 4-filters in a square pattern. Each native pixel includes a detector coupled to a polarization filter. In an embodiment, the polarization filter is configured to allow linearly polarized light at a particular polarization angle to pass through on to the detector while blocking linearly polarized light at other polarization angles. Said another way, each of the one or more native pixels in a polarization sensitive pixel is sensitive to linearly polarized light at a given polarization angle. The polarization filters may be composed of, e.g., polymer (e.g., film), or some other material. In one example embodiment, the camera 230 comprises an array of polarization sensitive pixels wherein each polarization sensitive pixel comprises four native pixels. Each native pixel is configured to detect linearly polarized light at a specific polarization angle. In the example embodiment described above, the four native pixels are configured to detect linearly polarized light at polarization angles 0°, 45°, 90°, and 135°.

In other embodiment, each polarization sensitive pixel is a pixel sensitive to the intensity of the incident light and the camera 230 is coupled to a dynamic polarizer. The dynamic polarizer is an optical filter that lets light waves of three or more specific polarizations pass and blocks light waves of all other polarizations. In various embodiments, the dynamic polarizer is a rotating physical polarizer, a voltage-controlled LCD film, or some other component capable of filtering light based on its polarization state. In these embodiments, the dynamic polarizer is placed somewhere in the optical path of the camera 230. For example, the dynamic polarizer is placed such that the light (e.g., corneal reflections traveling from the eye to the camera 230) must first pass through the dynamic polarizer. The camera 230 is configured to acquire the three or more polarization measurements temporally. That is, the camera 230 captures an image of the corneal surface at three or more different polarization states as the physical polarizer rotates. In one or more embodiments, the dynamic polarizer is configured to allow the passage of linearly polarized light in three or more polarization states to the array of pixels. For example, the mechanical polarization rotator is configured to allow linearly polarized light at angles of 0°, 45°, and 90° to pass through to the array of pixels. In the example, above the dynamic polarizer alternates between the polarization states to which it is permeable such that the camera 230 comprising an array of pixels operates at a speed greater than 90 frames per second (fps). That is, in the previous example, the camera 230 captures an image of the eye at a single polarization angle (e.g., one of three or more polarization angles) once every 11 milliseconds. Thus, the camera 230 captures a full frame image with a single polarization angle, reads out the entire frame, and changes the polarization state of the dynamic polarizer at least three times every frame. The rate at which the dynamic polarizer alternates between the polarization states may be controlled electronically (e.g., by the controller 240). Alternating between different polarization states may be associated with a delay (e.g., full-frame read-out delay, and reset delay). One advantage of using a dynamic polarizer coupled to the camera 230 allows for the capture of images of the corneal surface with maximal resolution, as every pixel can yield a measurement independent of its neighboring pixels.

In other embodiment, each polarization sensitive pixel of the camera 230 is comprised of multiple strained organic photovoltaic cells and/or unstrained organic photovoltaic cells. The strained and unstrained organic photovoltaic cells can preferentially absorb polarized light along one direction to measure linear polarization states of reflected light. For example, two strained organic photovoltaic cells and one unstrained photovoltaic cell can be stacked together to comprise one super pixel (i.e., one polarization sensitive pixel) of the camera 230. Fast axes of the two strained organic photovoltaic cells corresponding to maximum transmissions of the photovoltaic cells may be arranged in different angles (e.g. 0°, 45°) relative to their horizontal axes to minimize a polarimeter condition number so as to improve the error performance of the polarization sensitive pixel in the camera 230. Note that values for the minimized polarimeter condition number can be different for different photovoltaic cell parameters (e.g., transmission, diattennuation, retardance, etc.). An angle of a fast axis of a photovoltaic cell in the super pixel of the camera 230 may be optimized based on one or more photovoltaic cell parameters in order to minimize the polarimeter condition number. For example, if an appropriate organic photovoltaic material is chosen and then fabricated in a preferred manner, the 2-norm condition number may be smaller than two.

In an alternate embodiment, each polarization sensitive pixel of the camera 230 is associated with one or more charge storage bins. Here, the charge accumulated on the polarization sensitive pixel while the pixel is exposed to light is sequentially dumped in to a particular charge storage bin associated with the polarization state of the dynamic polarizer. For example, as the dynamic polarizer changes polarization state, the polarization sensitive pixel changes the charge storage bin into which it dumps charge. In another embodiment, each polarization sensitive pixel is configured to alternate the polarization angle to which it is sensitive multiple times (e.g., 0°, 120°, 240°, 0°, 120°, 240°, 0°, 120°, 240°, . . . ) The charge collected at each polarization angle is sequentially dumped into a charge storage bin associated with the polarization state of the light in order to interleave the captures. In still another embodiment, the dynamic polarizer continuously alternates the polarization angle to which it is permeable and charge collected at each polarization angle is dumped into the appropriate charge storage bin. In principle, any number of charge storage bins may be associated with a single polarization sensitive pixel. However, in practice, the number of charge storage bins may be limited by pixel size and desired dynamic range.

Examples of the camera 230 include a photodiode, a balanced/matched photodiodes, avalanche photodiode, a photomultiplier tube (PMT), a solid state detector, other suitable detector based on a single point detection, a still camera, a video camera, a linear photodiode array, a charge-coupled device (CCD) array, a complementary metal-oxide semiconductor (CMOS) array, or other suitable detector based on one or two-dimensional detector arrays. The camera 230 has a plurality of parameters such as focal length, focus, frame rate, ISO, sensor temperature, shutter speed, aperture, resolution, etc. In some embodiments, the camera 230 has a high frame rate and high resolution. The camera 230 can capture either two-dimensional images or three-dimensional images. The camera 230 is placed such that the eye reflections in response to the light from the illumination sources incident upon the eye can be captured over a range of eye movements. The camera 230 is further described, below, in conjunction with FIG. 3.

The controller 240 controls one or more components within the eye tracking system 200. In some embodiments, a single controller 240 may control the multiple eye tracking systems 200 such as one eye tracking system 200 for the left eye and another eye tracking system 200 for the right eye. The controller 240 is configured to communicate with the various modules within the eye tracking system 200 and facilitates communication between the modules comprising the eye tracking module 200. In an embodiment, the controller 240 generates one or more eye tracking instructions to control one or more of the camera 230, the illumination source 220, and the data store 210 in order to update a model of the eye 180. For example, the generated eye tracking instructions may activate the illumination source 220 including the camera 230. In other embodiments, the controller 240 is configured to retrieve one or more polarized intensity maps (e.g., images) of the corneal surface from the camera 230.

The one or more instructions may comprise instructions to modify one or more of the plurality of parameters associated with the camera 230. For example, the instructions may modify one or more of a focal length, a focus, a frame rate, an ISO, a shutter speed, an aperture, or a resolution associated with the optical detector. For example, the controller 240 generates instructions for the camera 230 to capture images of the corneal surface at a frame rate of 30 frames per second and a shutter speed of 1/400 (e.g., the shutter of the camera 230 is exposed to the eye reflections for 1/400^(th) of a second). In still other embodiments, the controller 240 generates instructions to retrieve an eye model from an HMD console. The HMD console is further described below in conjunction with FIG. 6. The method of calibrating a model is further described below. The controller 240 may also facilitate communication between the eye tracking system 200 and the HMD console.

In some embodiments, the controller 240 uses polarization information to determine a 3D shape of the eye. In various embodiments, the controller 240 determines one or more surface normals from the image of the eye captured by the camera 230. Each of the one or more surface normals is associated with a polarization sensitive pixel in the array of polarization sensitive pixels in the camera 230. A surface normal is a normalized vector (i.e., magnitude of 1) in three-dimensional space that is perpendicular to the image of the eye captured by a polarization sensitive pixel. The direction of the surface normal is defined by an azimuthal angle (i.e., angle between the X and Y axis) and a zenith angle (i.e., angle between the Z and X axis).

To determine a shape of the eye based on polarization information, the controller 240 receives light intensity received at each of the one or more polarization sensitive pixels. Typically, the intensity of the light reflected of the eye is a sinusoidal function of its polarization angle and the azimuthal angle. Thus, sampling the intensity of the linearly polarized light received from the eye at, at least, three polarization angles allows the determination of the azimuth angle of the surface normal. In an example embodiment, a polarization sensitive pixel comprises four native pixels configured to capture linearly polarized light at four different polarization angles (e.g., 0°, 45°, 90°, and 135). In a different embodiment, the dynamic polarizer associated with a camera 230 is alternated between four different polarization angles (e.g., 0°, 45°, 90°, and 135). In both of these example embodiments, each polarization sensitive pixel captures a maximum and a minimum intensity as a function of incident polarization angle to determine an azimuthal angle associated with a surface normal at that pixel.

The controller 240 determines a degree of polarization associated with the light captured at each polarization sensitive pixel in the array of polarization sensitive pixels. In one or more embodiments, the degree of polarization of the light captured at a polarization sensitive pixel is a ratio of the difference between the maximum and minimum intensities and the sum of the maximum and minimum intensities received at a given polarization sensitive pixel. Generally, the degree of polarization of an eye reflection is a function of the index of refraction and the zenith angle. In various embodiments, the controller 240 estimates a zenith angle of a surface normal using a closed form solution to a Fresnel equation relating the degree of polarization and zenith angle. In other embodiments, the controller 240 determines zenith angle by various numerical optimization techniques known to one skilled in the art.

The controller 240 may determine a 3D shape of the eye (e.g., eye 180) based on the determined surface normal at each polarization sensitive pixel. Typically, the determined surface normal at a particular polarization sensitive pixel is parallel to the gradient of the surface of the eye corresponding to that polarization sensitive pixel. Said another way, a surface f_(p) (e.g., the eye 180) corresponding to a polarization sensitive pixel p is associated with the plane L_(p). The plane L_(p) is a tangent plane of the surface f_(p) and roughly approximates the surface f_(p) near the polarization sensitive pixel p. Thus it follows that the mathematical union of the planes L_(p) roughly approximates the surface f_(p). In various embodiments, the controller 240 reduces the ambiguity associated with determining a shape of the eye by assuming that the eyeball is roughly a sphere.

Each determined surface normal uniquely identifies a plane L_(p). The controller 240 is configured to stitch planes L_(p) associated with adjacent polarization sensitive pixels together to form a curved surface associated with a composite 3D image of the eye. In various embodiments, the planes L_(p) associated with adjacent pixels are stitched together using a map projection technique (e.g., a rectilinear projection, a cylindrical project, a spherical projection, a Pannini projection, or a stereographic projection). In other embodiments, the one or more surface normals are segmented such that each segment is associated with a plane L_(p) wherein the planes L_(p) may define a curved surface associated with a model component (i.e., cornea, sclera).

The controller 240 determines, for each polarization sensitive pixel in a subset of the plurality of polarization sensitive pixels of the camera 230, one or more estimation parameters associated with the eye tracking system 200. The controller 240 further determines depth information (i.e., absolute depth) for one or more glints associated with one or more surfaces of a plurality of surfaces of the eye, based in part on the determined polarization information and the one or more estimation parameters. Details about determining the one or more estimation parameters and the absolute depth are discussed in conjunction with FIG. 4.

The controller 240 may extract one or more model parameters from the determined 3D shape of the eye and/or the determined depth information. In one or more embodiments, the controller 240 maps a model of the eye to the determined 3D shape of corneal surface and/or the determined depth information to determine one or more model parameters (e.g., sphere radius, sphere centroid, optical axis, etc.). In an embodiment, the controller 240 is configured to segment the determined 3D shape of the eye into one or more segments (e.g., the sclera and corneal surface). Each of the two segments of the 3D shape is mapped to a sphere associated with the model (e.g., eye sphere and corneal sphere). For example, a change in the radius of curvature between different segments of the 3D model of the eye is used to differentiate portions of the 3D shape of the eye corresponding to the two spheres comprising the model of the eye. The controller 240 also extracts other model parameters (e.g., a center location of the corneal sphere, a radius of the eye sphere, a center of the eye sphere, a pupil diameter, and a pupil location) from the determined 3D shape of the eye and/or the depth information.

In one or more embodiments, generating a 3D shape of the eye comprises generating a depth map of the eye 180. A depth map is generated by applying a geometric-based technique. In one or more embodiments, the geometric-based technique may be combined with a photometric approach. In these embodiments, the geometric-based technique removes the ambiguities in the photometric techniques and generates a course depth map of the eye. Some examples of photometric approaches include shape-from-shading (SfS) or uncalibrated photometric stereo (PS). The controller 240 may combine the generated course depth map of the eye with the one or more surface normals to obtain a 3D shape of eye. Methods of combining one or more surface normals with course depth maps to generate a 3D shape of the eye are well known to one skilled in the art.

The controller 240 updates an existing model of the eye (e.g., eye 180) based on the determined depth information. In various embodiments, updating a model includes activating the illumination source 220 and capturing one or more images including eye reflection at the camera 230 while the user is viewing known locations on the HMD 100. The one or more captured images are further processed in order to generate a depth map of the eye. The controller 240 extracts one or more values of the model parameters from the generated depth map (e.g., sphere radius, sphere centroid, etc.). In one or more embodiments, the controller 240 retrieves an existing model of the eye from the data store 210 and determines a difference between the one or more extracted parameter values and the one or more stored parameter values associated with the existing model. Responsive to determining a difference, the controller 240 updates one or more of the model parameter values associated with the existing model of the eye. For example, if the controller 240 determines that the smaller sphere associated with an existing model of the eye previously has a radius of 8 mm and the extracted parameter value for the radius of the smaller sphere is 25 mm, the controller 240 updates the model parameter values associated with the existing model of the eye. Typically, updating the eye model comprises adjusting the eye model parameters stored associated with the model in the data store 210. For example, based on the determined depth information, the controller 240 adjusts one or more model parameter values (e.g., radii and a center of a sphere) associated with the existing eye model. In various embodiments, the existing model of the eye is replaced with the updated model of the eye in the data store 210. The process of updating an existing model is further described below in conjunction with FIGS. 3-5. The updated model of the eye may be used to determine eye tracking information. Methods of determining eye tracking information from an updated model of the eye are described below in conjunction with FIG. 3.

FIG. 3 depicts an example operation of an eye tracking system 300, in accordance with an embodiment. FIG. 3 depicts the eye tracking system 300 imaging an eye of the user (e.g., eye 320). Here the eye 320 is super imposed by an eye model. As illustrated in FIG. 3, the eye tracking system 300 includes an LED 302, and a polarization sensitive camera 304. Here the eye tracking system 300 is an embodiment of the eye tracking system 200. Both the infrared LED 302 and the polarization sensitive camera 304 are embodiments of the illumination source 220 and the camera 230, respectively. For simplification, the discussion of the eye tracking system 300 is about a single eye of the user. However, in some embodiments, a separate eye tracking system 300 is employed for each of the user's eyes 320. In such embodiments, each of the eye tracking systems 300 is configured to update a separate eye model wherein each eye model represents a different eye 320 (e.g., left, right). In such the embodiments, each eye tracking system 300 is additionally configured to communicate amongst themselves and a console associated with the HMD.

In FIG. 3, the polarization sensitive camera 304 is placed such that the eye reflections (including one or more glints) in response to the light from the LED 302 incident upon the eye 320 can be captured over a range of eye movements. In this embodiment, the eye tracking system 300 and the LED 302 are placed off-axis such that they are not in the light of sight of the user. In one or more embodiments, the LED 302 comprises a plurality of LEDs 302. In various embodiments, the LEDs 302 are arranged such that they specifically illuminate the eye 320. For example, the plurality of LEDs 302 may be arranged in a ring formation around an eye 320. In other embodiments, the plurality of LEDs 302 may be arranged in other conformations such that they optimally illuminate with light.

Light 306-I illustrates one of the plurality of light rays emitted by the LED 302. The light 306-I incident upon the eye 320 produces multiple reflections such as a reflection from the outer surface of the cornea and another reflection from the inner surface of the cornea. Light 306-R represents one of the plurality of light rays reflected from the eye 320. In one or more embodiments, the light 306-R is directed towards the polarization sensitive camera 304. In one embodiment, the polarization sensitive camera 304 captures the intensity of the reflected light 306-R and a polarization angle. For example, in FIG. 3, the intensity and polarization angle of the reflected light represented by arrow 306-R is captured in images captured by the polarization sensitive camera 304. Alternatively, or additionally, the polarization sensitive camera 304 captures intensity and the polarization angle associated with the reflected light 306-R from the inner surface of the cornea.

An example normal mode of operation of the eye tracking system 300 includes activating the LED 302 and illuminating the corneal surface of an eye 320 with light 306-I. Illumination of the corneal surface of the eye 320 results in a reflected light ray 306-R directed towards the polarization sensitive camera 304. The reflected light 306-R is captured by the polarization sensitive camera 304. During the normal mode of operation, a controller (e.g., controller 240) determines depth information for one or more glints associated with one or more surfaces of the eye 320. The controller further retrieves a stored eye model associated with the user's eye (e.g., eye 180) from the data store associated with the eye tracking system (e.g., data store 210). Responsive to determining a difference between one or more model parameters associated with the determined depth information and the one or more model parameters associated with the stored eye model, the controller updates the stored eye model and determines eye tracking information (e.g., gaze direction). An example mode of operation of the eye tracking system 200 is further described below in conjunction with FIG. 6.

The eye tracking system 300 updates a model of the eye. The model for the eye, including one or more model parameter values associated with the retrieved model, is retrieved from a data store (e.g., data store 210) described above in conjunction with FIG. 2. In an example state of operation, the eye tracking system 300 retrieves a stored model of a human eye 320 from the data store 210. As depicted in FIG. 3, the eye model 320 comprises two spheres: an eye sphere 324 and a corneal sphere 325. Each of spheres, eye sphere 324 and corneal sphere 325, has different radii such that the radius of eye sphere 324 is larger than that of corneal sphere 325. Here, a portion of the eye sphere 324 approximates the overall eye and a portion of the corneal sphere 325 represents the cornea of the eye. The center (or origin) of the eye sphere 324 is represented by point 328 and the center of corneal sphere 325 is represented by point 326. Element 322 represents the lens of the eye. In other embodiments, the cornea may be modeled as a complex surface.

Reflections of light 306-I from the eye 320 result in light 306-R wherein the polarization state of light 306-R may be different from the polarization state of the light 306-I. The change in polarization state of the eye reflections is captured by the polarization sensitive camera 304. In an embodiment, based on the polarization state of the eye reflections, the eye tracking unit 300 generates a depth map of the corneal surface. The generated depth map of the corneal surface can be used to determine a radius of curvature of the overall eye and the cornea. The radius of curvature of the overall eye and the cornea may be used to determine the radii of spheres 324 and 325, respectively. For example, the curvature of the overall eye, as determined from generated depth map, is used to determine a value for the radius of the eye sphere 324 and radius of curvature of the cornea. In various embodiments, the determined radius of curvature of the cornea and the overall eye are additionally used to determine the relative locations of point 326 and point 328. In still other embodiments, the eye tracking system 300 utilizes intensity data associated with the received eye reflections (e.g., absorption data) to determine a size and shape of element 322.

The eye tracking system 300 updates the eye model 320 based on the determined depth information for the one or more glints associated with one or more surfaces of the eye 320 in order to track the motion of the center of the corneal sphere 325 during the rotation of the eye as the user rotates their eye to look at various objects in the HMD 610. In various embodiments, updating an eye model comprises updating one or more model parameters associated with the eye model 320 based on a determined difference between a stored value associated with a model parameter and determined value of the model parameter. In one or more embodiments, the eye model 320 is updated in real-time as the user rotates their eyes to look at various objects in the HMD 610. An example process for updating a model is described further, below, in conjunction with FIG. 6.

The eye tracking system 300 additionally determines eye tracking information associated with the eye based on one or more model parameters. Eye tracking information includes eye orientation, eye rotation, aspheric eye shape, and eye deformation under rotation. For example, the center 328 of eye sphere 324 is a function of the cornea position and enables determining of aspheric eye shapes, oculomotor muscle control, deformation under rotation, etc. In various embodiments, the eye tracking system 300 determines a gaze direction based on the determined eye tracking information. In various embodiments, eye information is also stored in the data store 210. In still other embodiments, the eye model information is stored in a separate database associated with the eye tracking system 300.

FIG. 4 depicts an example eye tracking system 400 for determining an absolute depth of an eye's surface 405, in accordance with an embodiment. The system 400 shown in FIG. 4 includes a linear polarizer 410, a polarization sensitive camera 415, and at least one unpolarized light source 425 displaced relative to an optical center 430 of the polarization sensitive camera 415. In some embodiments, the eye's surface 405 includes at least a portion of a corneal sphere (e.g., the corneal sphere 325 in FIG. 3), or at least a portion of an eye sphere (e.g., the eye sphere 324). Alternatively, the eye's surface 405 can represent any surface or a portion of surface of the eye 320 (e.g., cornea, sclera, pupil, etc.). The method presented herein utilizes Shape from Polarization (SfP) approach to determine an absolute depth of the eye's surface 405 reflecting a glint related to the light source 425. The absolute depth may be determined with respect to x, y and z dimensions relative to the optical center 430, e.g., a hypothetical pinhole camera aperture of a geometrically calibrated camera. In order to determine the absolute depth of the eye's surface 405, an absolute location of illumination sources of the eye tracking system 400 should be known a priori, as discussed in more detail below.

The linear polarizer 410 is placed in front of the polarization sensitive camera 415. The camera 415 can be an embodiment of the camera 230 in FIG. 2 and/or the camera 304 in FIG. 3. The linear polarizer 410 can rotate, e.g., based on instructions from a controller 420 coupled to the linear polarizer 410, in order to measure image intensity waveform and determine a degree of polarization, as discussed above. The controller 420 can be an embodiment of the controller 308 in FIG. 3 and/or the controller 240 in FIG. 2. A displacement of the unpolarized light source 425 relative to the camera 415 is denoted as D in FIG. 4. In some other embodiments (not shown in FIG. 4), the linear polarizer 410 is replaced with an array of polarization filters having different fixed orientations placed in front of a “super pixel” of the camera 415 that includes three or more native pixels. The camera 415 has a focal length off, as denoted in FIG. 4. The eye's surface 405 at the position (X, Y, Z) with respect to the camera 415 is imaged at a distanced away from an optical center 430 of the camera 415. The controller 420 may be further coupled to the camera 415 to receive polarization information captured by the camera 415.

As discussed above, the controller 420 may determine a degree of linear polarization associated with light captured at each polarization sensitive pixel of the camera 415 based on image intensity waveforms measured as the linear polarizer 410 is rotated and occupies at least three different positions. In one or more embodiments, the degree of linear polarization of the light captured at a polarization sensitive pixel 435 of the polarization based camera 415 is a ratio of the difference between the maximum and minimum intensities and the sum of the maximum and minimum intensities received at a given polarization sensitive pixel. In various embodiments, as discussed, the controller 420 estimates a zenith angle θ of a surface normal using a closed form solution to a Fresnel equation relating the degree of linear polarization and zenith angle. In other embodiments, as discussed, the controller 420 determines the zenith angle θ by various numerical optimization techniques known to one skilled in the art.

In accordance with embodiments of the present disclosure, the zenith angle θ can be then used to uniquely determine a depth Z of the eye's surface 405 reflecting a glint related to the light source 425. The location of the eye's surface 405 with respect to other dimensions (e.g., x and/or y) can be determined after knowing a-priori system parameters, e.g., D and f, and after calculating parameters d and θ.

For each pixel of the polarization based camera 415, an angle α and displacement D are known a priori from a pre-determined camera calibration procedure. As shown in FIG. 4, the angle α is an angle between a ray of light reflected from the eye's surface 405 and an axis of the polarization sensitive pixel 435 capturing the ray of reflected light (or an axis corresponding to the optical center 430). After determining the parameter d for each polarization sensitive pixel 435 of the camera 415 and knowing a focal distance f of the camera 415, the angle α can be obtained for each polarization sensitive pixel 435 from the ratio of d and f, i.e.,

$\begin{matrix} {{\tan\;\alpha} = {\frac{d}{f}.}} & (1) \end{matrix}$

As discussed, the zenith angle θ and the azimuthal angle φ can be measured (e.g., by the controller 420) from the determined degree of linear polarization and the intensity waveform. After determining the angle α based on equation (1), the ratio between the displacement D and the zenith angle θ can be calculated as:

$\begin{matrix} {\frac{D}{\sin\; 2\theta} = {\frac{\sqrt{X^{2} + Z^{2}}}{\sin\left( {\frac{\pi}{2} + \alpha - {2\theta}} \right)} = {\frac{\sqrt{\left( {Z\;\tan\;\alpha} \right)^{2} + Z^{2}}}{\cos\left( {\alpha - {2\theta}} \right)} = {\frac{Z{{\sec\;\alpha}}}{\cos\left( {\alpha - {2\theta}} \right)}.}}}} & (2) \end{matrix}$

Based on equation (2), the absolute depth Z can be determined (e.g., by the controller 420) as:

$\begin{matrix} {Z = {\frac{D \cdot {\cos\left( {\alpha - {2\theta}} \right)}}{\sin\; 2{\theta \cdot {{\sec\;\alpha}}}}.}} & (3) \end{matrix}$ The location X of the eye's surface 405 along x dimension can be determined as: X=Z tan α.  (4)

Thus, the position (X, Z) of a reflection surface along x and z dimensions, e.g., a surface of a cornea, can be determined from the known geometric constraints, partially informed from the polarization-based estimation of the surface normal. Note that the displacement between an aperture of the camera 415 and the at least one illumination source 420 can be any arbitrary displacement and may not always occur in the x-z plane, as illustrated in FIG. 4. In addition, the at least illumination source 420 may not be collinear with the focal plane array. In these cases, the equations (1)-(4) become more complex, but the concept is identical as discussed herein, and can be used to determine the location (X, Y, Z) of the eye's surface 405 with respect to all three space dimensions.

FIG. 5 is a flow chart illustrating an eye tracking process 500, in accordance with an embodiment. The process illustrated in conjunction with FIG. 5 may be performed by the eye tracking system 200. In other embodiments, the process 500 may be performed by different and/or additional modules. Likewise, embodiments may include different and/or additional steps, or perform the steps in different orders.

The eye tracking system 200 illuminates 510 a user's eye (e.g., the eye 180) with light. In an embodiment, the eye is illuminated by one or more illumination sources (e.g., the illumination source 220). The one or more illumination sources emit light which is incident on the eye such that a surface of the eye reflects the incident light. The reflection causes changes in polarization that can be used to determine depth information. In some embodiments, the one or more illumination sources emit polarized light (e.g., linearly polarized). In alternate embodiments, the one or more illumination sources emit unpolarized light.

The eye tracking system 200 captures 520 (e.g., by a polarization sensitive camera having an array of a plurality of polarization sensitive pixels, such as the camera 230) one or more images of the eye including intensity and polarization information of light reflected from at least one surface of the eye. In various embodiments, the captured images of the eye comprise intensity information including one or more polarization angles associated with the reflected light. The determination of polarization angles and intensity information by the eye tracking system 200 is further described above in conjunction with FIGS. 2 and 3.

The eye tracking system 200 determines 530, for each polarization sensitive pixel in a subset of the plurality of polarization sensitive pixels, one or more estimation parameters associated with the eye tracking system. In some embodiments, the one or more estimation parameters determined for each polarization sensitive pixel in the subset include: a displacement between one of the one or more illumination sources and that polarization sensitive pixel, and an angle between a ray of light reflected from a surface of the eye and an axis of that polarization sensitive pixel capturing the ray of reflected light. Determining the one or more estimation parameters is further described, above, in conjunction with FIG. 4.

The eye tracking system 200 determines 540, for the subset of the plurality of polarization sensitive pixels, depth information for one or more glints associated with one or more surfaces of a plurality of surfaces of the eye, based in part on the determined polarization information and the one or more estimation parameters. In some embodiments, the eye tracking system 200 captures for at least three polarization angles, an intensity of light reflected from at least one surface of the eye. The eye tracking system 200 determines an azimuthal angle associated with the reflected light based on the determined polarization angle and the intensity. The eye tracking system 200 determines a degree of polarization of the reflected light based in part on the captured intensity. The eye tracking system 200 determines a zenith angle associated with the reflected light based in part on the determined degree of polarization. The eye tracking system 200 determines depth information for a glint of the one or more glints based on the one or more estimation parameters and the zenith angle. Determining the depth information is further described, above, in conjunction with FIG. 4 and in accordance with the equations (1)-(4).

In some embodiments, the eye tracking system 200 determines a plurality of surface normals associated with the subset of the plurality of polarization sensitive pixels based on the determined intensity and polarization information associated with the reflected light (e.g., light 306-R). The eye tracking system 200 may determine a 3D shape of the eye (e.g., eye 180) based in part on the determined surface normals. Each surface normals is associated with a portion of the surface of the eye being imaged. The surface normal is used to determine a plane in 3D space that is tangent to the surface of the eye at that point on of the eye being imaged. In one or more embodiments, the eye tracking system 200 combines adjacent tangent surfaces to determine the 3D shape of the eye. In various embodiments, the eye tracking system 200 stiches each of the tangent planes together to determine the 3D shape of the eye and extracts one or more model parameter from the determined 3D shape of the eye. Determining a 3D shape the eye is further described, above, in conjunction with FIG. 2.

The eye tracking system 200 updates 550 a model of the eye (e.g., eye 180) using the determined depth information. In some embodiments, the eye tracking system 200 updates 550 the model of the eye by fitting one or more model components of a plurality of model components in the model to the one or more surfaces of the eye, using the depth information for the one or more glints. In various embodiments, the eye tracking system 200 retrieves a stored eye model (e.g., the model 320) from a data store (e.g., the data store 200). In still other embodiments, the eye tracking system 200 retrieves a model of the eye from an external source (e.g., a console associated with the HMD). In alternate embodiments, the eye tracking system 200 generates a model of the eye based on the determined depth information. As the user looks at various objects in the HMD, the eye tracking system 200 may periodically determine the depth information and extract one or more model parameter values (e.g., radii and the relative locations of the spheres) and compare the extracted model parameter values to stored model parameter values. Responsive to determining a difference between one or more of the extracted model parameter values and the stored model parameter values, the eye tracking system 200 updates 550 the model parameter values associated with the stored model. In some embodiments, the stored model of the eye comprises a first sphere that describes a shape of a sclera of the eye and a second sphere that describes a shape of a cornea of the eye. The stored model may be parameterized by a radius associated with each of the two spheres, a location of the centers of each of the two spheres, and a distance between the centers each of the two spheres

The eye tracking system 200 determines 560 eye tracking information based on the updated model. In some embodiments, the eye tracking system 200 determines the eye tracking information based in part on the fitted one or more model components. The eye tracking information includes one or more of a gaze direction, a vergence angle, a vergence depth, and an accommodation depth. In one or more embodiments, the eye tracking information (e.g., user's gaze) is determined in real-time based on the updated eye model. Eye tracking information is described above in conjunction with FIG. 2.

In some embodiments, the eye tracking system 200 is integrated into a HMD, e.g., the HMD 100 in FIG. 1A. The HMD may be part of an artificial reality system. The HMD may further comprise a display unit and an optics block. The display unit is configured to emit content (image light), wherein the emitted content may be based at least in part on the eye tracking information. The optics block is configured to direct the content to an eye-box of the HMD, based at least in part on the eye tracking information.

System Environment

FIG. 6 is a block diagram of a HMD system 600, in accordance with an embodiment. The HMD system 600 may operate in an artificial reality system environment. The HMD system 600 shown by FIG. 6 comprises a HMD 610, an input/output (I/O) interface 620 that are each coupled to a console 630, and an imaging device 640. While FIG. 1 shows an example system 100 including one HMD 610, one I/O interface 620, and one imaging device 640 in other embodiments any number of these components may be included in the HMD system 600. For example, there may be multiple HMDs 610 each having an associated I/O interface 620 and being monitored by one or more imaging devices 640, with each HMD 610, I/O interface 620, and imaging devices 640 communicating with the HMD console 630. In alternative configurations, different and/or additional components may be included in the system environment 600.

The HMD 610 presents content to a user. Examples of content presented by the HMD 610 include one or more images, video, audio, or some combination thereof. In some embodiments, audio is presented via an external device (e.g., speakers and/or headphones) that receives audio information from the HMD 610, the console 630, or both, and presents audio data based on the audio information. An embodiment of the HMD 610 is further described above in conjunction with FIGS. 1A and 1B. The HMD 610 may comprise one or more rigid bodies, which may be rigidly or non-rigidly coupled to each other. A rigid coupling between rigid bodies causes the coupled rigid bodies to act as a single rigid entity. In contrast, a non-rigid coupling between rigid bodies allows the rigid bodies to move relative to each other.

The HMD 610 includes one or more locators 120, one or more position sensors 135, an inertial measurement unit (IMU) 130, an eye tracking system 615, display unit 160, and an optics block 165. The display unit 160, the optics block 165, the one or more locators 120, the one or more position sensors 135, and IMU 130 are described, above, in conjunction with FIGS. 1A and 1B.

The eye tracking system 615 determines depth information for one or more glints associated with one or more surfaces of the eye based on captured polarization information and one or more estimation parameters associated with the eye tracking system 615. The eye tracking system 615 is an embodiment of the eye tracking system 200 described above in conjunction with FIG. 2. The eye tracking system 615 includes one or more illumination sources (e.g., the illumination source 220) and a camera (e.g., the camera 230). The one or more illumination sources are configured to illuminate an outer portion of the user's eye thus generating eye reflections directed towards the camera. In various embodiments, the camera (e.g., the camera 230) is a polarization sensitive camera. The imaging device captures one or more images of the corneal surface including intensity and polarization information associated with the eye reflections. In some embodiments, the eye tracking system 615 determines, for a subset of the polarization sensitive pixels of the camera, depth information for one or more glints associated with one or more surfaces of the eye, based in part on the polarization information, one or more distances between the camera and the one or more illumination sources, and the one or more estimation parameters. In various embodiments, the eye tracking system 615 extracts one or more model parameter values from the determined depth information and updates a stored model of the eye responsive to a difference between a stored model parameter value and an extracted model parameter value. In some embodiments, the eye tracking system 615 updates the model of the eye by fitting one or more model components (model surfaces) of a plurality of model components in the eye model to the one or more surfaces of the eye, using the depth information for the one or more glints. The eye tracking system 615 may determine the eye tracking information based in part on the fitted one or more model components.

The I/O interface 620 is a device that allows a user to send action requests to the HMD console 630. An action request is a request to perform a particular action. For example, an action request may be to start or end an application or to perform a particular action within the application. The I/O interface 620 may include one or more input devices. Example input devices include: a keyboard, a mouse, a game controller, or any other suitable device for receiving action requests and communicating the received action requests to the HMD console 630. An action request received by the I/O interface 620 is communicated to the HMD console 630, which performs an action corresponding to the action request. In some embodiments, the I/O interface 620 may provide haptic feedback to the user in accordance with instructions received from the HMD console 630. For example, haptic feedback is provided when an action request is received, or the HMD console 630 communicates instructions to the I/O interface 620 causing the I/O interface 620 to generate haptic feedback when the HMD console 630 performs an action.

The HMD console 630 provides content to the HMD 610 for presentation to the user in accordance with information received from one or more of: the imaging device 640, the HMD 610, and the I/O interface 620. In the example shown in FIG. 6, the HMD console 630 includes an application store 632, a tracking module 634, and an engine 636. Some embodiments of the HMD console 630 have different modules than those described in conjunction with FIG. 6. Similarly, the functions further described below may be distributed among components of the HMD console 630 in a different manner than is described here.

The application store 632 stores one or more applications for execution by the HMD console 630. An application is a group of instructions, that when executed by a processor, generates content for presentation to the user. Content generated by an application may be in response to inputs received from the user via movement of the HMD 610 or the input interface 640. Examples of applications include: gaming applications, conferencing applications, video playback application, or other suitable applications.

The tracking module 634 calibrates the system 600 using one or more calibration parameters and may adjust one or more calibration parameters to reduce error in determination of the position of the HMD 610. For example, the tracking module 634 adjusts the focus of the imaging device 640 to obtain a more accurate position for observed locators on the HMD 610. Moreover, calibration performed by the tracking module 634 also accounts for information received from the IMU 630. Additionally, if tracking of the HMD 610 is lost (e.g., the imaging device 640 loses line of sight of at least a threshold number of the locators 620), the tracking module 140 re-calibrates some or the entire system environment 600.

The tracking module 634 tracks movements of the HMD 610 using slow calibration information from the imaging device 640. The tracking module 634 determines positions of a reference point of the HMD 610 using observed locators from the slow calibration information and a model of the HMD 610. The tracking module 634 also determines positions of a reference point of the HMD 610 using position information from the fast calibration information. Additionally, in some embodiments, the tracking module 634 may use portions of the fast calibration information, the slow calibration information, or some combination thereof, to predict a future location of the headset 105. The tracking module 634 provides the estimated or predicted future position of the HMD 610 to the engine 636.

The engine 636 executes applications within the system environment 600 and receives position information, acceleration information, velocity information, predicted future positions, or some combination thereof of the HMD 610 from the tracking module 634. Based on the received information, the engine 655 determines content to provide to the HMD 610 for presentation to the user. For example, if the received information indicates that the user has looked to the left, the engine 636 generates content for the HMD 610 that mirrors the user's movement in a virtual environment. Additionally, the engine 636 performs an action within an application executing on the HMD console 630 in response to an action request received from the I/O interface 620 and provides feedback to the user that the action was performed. The provided feedback may be visual or audible feedback via the HMD 610 or haptic feedback via the I/O interface 620.

The engine 636 may be configured to utilize eye tracking information determined by the eye tracking module 615 for a variety of display and interaction applications. The various applications include, but are not limited to, providing user interfaces (e.g., gaze-based selection), attention estimation (e.g., for user safety), gaze-contingent display modes (e.g., foveated rendering, varifocal optics, adaptive optical distortion correction, synthetic depth of field rendering), metric scaling for depth and parallax correction (e.g., IPD and eye-box adjustment), etc. In some embodiments, based on information about position and orientation of the user's eye received from the eye tracking system 634, the engine 636 determines resolution of the content provided to the HMD 610 for presentation to the user on the display unit 615. The engine 636 provides the content to the HMD 610 having a maximum pixel density (maximum resolution) on the display unit 615 in a foveal region of the user's gaze, whereas the engine 636 provides a lower pixel resolution in other regions of the display unit 615, thus achieving less power consumption at the HMD 610 and saving computing cycles of the HMD console 630 without compromising a visual experience of the user. In some embodiments, the engine 636 can be configured to optimize the performance of viewing optics of the HMD 610 (e.g., components of the display unit 160), based on the eye tracking information obtained from the eye tracking system 634. In one embodiment, the engine 636 can adjust optical distortion correction parameters of the viewing optics, e.g., to prevent vergence-accommodation conflict. In an alternate embodiment, the engine 636 can adjust focus of images displayed on the display unit 160, e.g., to prevent vergence-accommodation conflict.

The imaging device 640 generates slow calibration data in accordance with calibration parameters received from the HMD console 630. Slow calibration data includes one or more images showing observed positions of the locators 620 that are detectable by the imaging device 640. The imaging device 640 may include one or more cameras, one or more video cameras, any other device capable of capturing images including one or more of the locators 620, or some combination thereof. Additionally, the imaging device 640 may include one or more filters (e.g., used to increase signal to noise ratio). The imaging device 640 is configured to detect light emitted or reflected from locators 620 in a field of view of the imaging device 640. In embodiments where the locators 620 include passive elements (e.g., a retro-reflector), the imaging device 640 may include a light source that illuminates some or all of the locators 620, which retro-reflect the light towards the light source in the imaging device 640. Slow calibration data is communicated from the imaging device 640 to the HMD console 630, and the imaging device 640 receives one or more calibration parameters from the HMD console 630 to adjust one or more imaging parameters (e.g., focal length, focus, frame rate, ISO, sensor temperature, shutter speed, aperture, etc.).

Additional Configuration Information

The foregoing description of the embodiments of the disclosure has been presented for the purpose of illustration; it is not intended to be exhaustive or to limit the disclosure to the precise forms disclosed. Persons skilled in the relevant art can appreciate that many modifications and variations are possible in light of the above disclosure.

Some portions of this description describe the embodiments of the disclosure in terms of algorithms and symbolic representations of operations on information. These algorithmic descriptions and representations are commonly used by those skilled in the data processing arts to convey the substance of their work effectively to others skilled in the art. These operations, while described functionally, computationally, or logically, are understood to be implemented by computer programs or equivalent electrical circuits, microcode, or the like. Furthermore, it has also proven convenient at times, to refer to these arrangements of operations as modules, without loss of generality. The described operations and their associated modules may be embodied in software, firmware, hardware, or any combinations thereof.

Any of the steps, operations, or processes described herein may be performed or implemented with one or more hardware or software modules, alone or in combination with other devices. In one embodiment, a software module is implemented with a computer program product comprising a computer-readable medium containing computer program code, which can be executed by a computer processor for performing any or all of the steps, operations, or processes described.

Embodiments of the disclosure may also relate to an apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, and/or it may comprise a general-purpose computing device selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a non-transitory, tangible computer readable storage medium, or any type of media suitable for storing electronic instructions, which may be coupled to a computer system bus. Furthermore, any computing systems referred to in the specification may include a single processor or may be architectures employing multiple processor designs for increased computing capability.

Embodiments of the disclosure may also relate to a product that is produced by a computing process described herein. Such a product may comprise information resulting from a computing process, where the information is stored on a non-transitory, tangible computer readable storage medium and may include any embodiment of a computer program product or other data combination described herein.

Finally, the language used in the specification has been principally selected for readability and instructional purposes, and it may not have been selected to delineate or circumscribe the inventive subject matter. It is therefore intended that the scope of the disclosure be limited not by this detailed description, but rather by any claims that issue on an application based hereon. Accordingly, the disclosure of the embodiments is intended to be illustrative, but not limiting, of the scope of the disclosure, which is set forth in the following claims. 

What is claimed is:
 1. An eye tracking system comprising: a polarization sensitive detector configured to: capture one or more images of an eye, and determine polarization information for the one or more images; and a controller configured to: determine depth information for at least one glint associated with at least one surface of the eye using the polarization information and at least one estimation parameter associated with the polarization sensitive detector, and determine a gaze direction for the eye using the depth information.
 2. The eye tracking system of claim 1, wherein the controller is further configured to: generate a model of the eye by fitting at least one component of the model to the at least one surface of the eye using the depth information; and determine the gaze direction for the eye based on the generated model of the eye.
 3. The eye tracking system of claim 1, wherein the polarization sensitive detector comprises an array of polarization sensitive elements.
 4. The eye tracking system of claim 3, wherein at least one polarization sensitive element in the array comprises a plurality of native pixels.
 5. The eye tracking system of claim 4, at least one of the native pixels comprises a polarization filter.
 6. The eye tracking system of claim 1, wherein the controller is further configured to: determine a plurality of surface normals using the polarization information; determine a three-dimensional (3D) shape of the eye using the surface normals; and determine the gaze direction for the eye using the determined 3D shape of the eye.
 7. The eye tracking system of claim 1, wherein the polarization sensitive detector is configured to detect intensities of light reflected from the at least one surface of the eye and captured by the polarization sensitive detector at multiple polarization angles.
 8. The eye tracking system of claim 7, wherein the controller is further configured to: determine a degree of polarization of the reflected light using the detected intensities; determine a zenith angle associated with the reflected light using the determined degree of polarization; and determine the depth information for the at least one glint based on the at least one estimation parameter and the zenith angle.
 9. The eye tracking system of claim 1, wherein the at least one estimation parameter comprises: at least one displacement between at least one illumination source illuminating the at least one surface of the eye and at least one polarization sensitive element of the polarization sensitive detector, and at least one angle between at least one ray of light reflected from the at least one surface of the eye and at least one axis of the at least one polarization sensitive element capturing the at least one ray of reflected light.
 10. The eye tracking system of claim 9, wherein: the polarization sensitive detector is configured to detect intensities of light reflected from the at least one surface of the eye and captured by the polarization sensitive detector at multiple polarization angles; and the controller is further configured to: determine a degree of polarization of the reflected light using the detected intensities, determine a zenith angle associated with the reflected light using the degree of polarization, and determine the depth information for the at least one glint based on the at least one displacement, the at least one angle and the zenith angle.
 11. The eye tracking system of claim 1, wherein the controller is further configured to: extract one or more model parameter values from the depth information; generate a model of the eye using the one or more extracted model parameters values; and determine the gaze direction for the eye based on the generated model of the eye.
 12. The eye tracking system of claim 1, wherein the polarization sensitive detector comprises a dynamic polarizer permeable to light of a defined set of polarization states.
 13. The eye tracking system of claim 1, wherein the eye tracking system is part of a headset.
 14. A method comprising: capturing, at a polarization sensitive detector, one or more images of an eye; determining polarization information for the one or more images; determining depth information for at least one glint associated with at least one surface of the eye using the polarization information and at least one estimation parameter associated with the polarization sensitive detector; and determining a gaze direction for the eye using the depth information.
 15. The method of claim 14, further comprising: generating a model of the eye by fitting at least one component of the model to the at least one surface of the eye using the depth information; and determining the gaze direction for the eye based on the generated model of the eye.
 16. The method of claim 14, further comprising: determining a plurality of surface normals using the polarization information; determining a three-dimensional (3D) shape of the eye using the surface normals; and determining the gaze direction for the eye using the determined 3D shape of the eye.
 17. The method of claim 14, further comprising: detecting intensities of light reflected from the at least one surface of the eye and captured by the polarization sensitive detector at multiple polarization angles; determining a degree of polarization of the reflected light using the detected intensities; determining a zenith angle associated with the reflected light using the determined degree of polarization; and determining the depth information for the at least one glint based on the at least one estimation parameter and the zenith angle.
 18. The method of claim 14, wherein the at least one estimation parameter comprises: at least one displacement between at least one illumination source illuminating the at least one surface of the eye and at least one polarization sensitive element of the polarization sensitive detector, and at least one angle between at least one ray of light reflected from the at least one surface of the eye and at least one axis of the at least one polarization sensitive element capturing the at least one ray of reflected light.
 19. The method of claim 18, further comprising: detecting intensities of light reflected from the at least one surface of the eye and captured by the polarization sensitive detector at multiple polarization angles; determining a degree of polarization of the reflected light using the detected intensities; determining a zenith angle associated with the reflected light using the determined degree of polarization; and determining the depth information for the at least one glint based on the at least one displacement, the at least one angle and the zenith angle.
 20. The method of claim 14, further comprising: extracting one or more model parameter values from the depth information; generating a model of the eye using the one or more extracted model parameters values; and determining the gaze direction for the eye based on the updated model of the eye. 